Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

A Method for Text Localization and Recognition in Real-World Images

Identifieur interne : 000524 ( Main/Exploration ); précédent : 000523; suivant : 000525

A Method for Text Localization and Recognition in Real-World Images

Auteurs : Lukas Neumann [République tchèque] ; Jiri Matas [République tchèque]

Source :

RBID : ISTEX:285364B6623C7301C6B9380A708BED60EE238BBF

Abstract

Abstract: A general method for text localization and recognition in real-world images is presented. The proposed method is novel, as it (i) departs from a strict feed-forward pipeline and replaces it by a hypotheses-verification framework simultaneously processing multiple text line hypotheses, (ii) uses synthetic fonts to train the algorithm eliminating the need for time-consuming acquisition and labeling of real-world training data and (iii) exploits Maximally Stable Extremal Regions (MSERs) which provides robustness to geometric and illumination conditions. The performance of the method is evaluated on two standard datasets. On the Char74k dataset, a recognition rate of 72% is achieved, 18% higher than the state-of-the-art. The paper is first to report both text detection and recognition results on the standard and rather challenging ICDAR 2003 dataset. The text localization works for number of alphabets and the method is easily adapted to recognition of other scripts, e.g. cyrillics.

Url:
DOI: 10.1007/978-3-642-19318-7_60


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct">
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">A Method for Text Localization and Recognition in Real-World Images</title>
<author>
<name sortKey="Neumann, Lukas" sort="Neumann, Lukas" uniqKey="Neumann L" first="Lukas" last="Neumann">Lukas Neumann</name>
</author>
<author>
<name sortKey="Matas, Jiri" sort="Matas, Jiri" uniqKey="Matas J" first="Jiri" last="Matas">Jiri Matas</name>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:285364B6623C7301C6B9380A708BED60EE238BBF</idno>
<date when="2011" year="2011">2011</date>
<idno type="doi">10.1007/978-3-642-19318-7_60</idno>
<idno type="url">https://api.istex.fr/document/285364B6623C7301C6B9380A708BED60EE238BBF/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000217</idno>
<idno type="wicri:Area/Istex/Curation">000214</idno>
<idno type="wicri:Area/Istex/Checkpoint">000181</idno>
<idno type="wicri:doubleKey">0302-9743:2011:Neumann L:a:method:for</idno>
<idno type="wicri:Area/Main/Merge">000530</idno>
<idno type="wicri:Area/Main/Curation">000524</idno>
<idno type="wicri:Area/Main/Exploration">000524</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a" type="main" xml:lang="en">A Method for Text Localization and Recognition in Real-World Images</title>
<author>
<name sortKey="Neumann, Lukas" sort="Neumann, Lukas" uniqKey="Neumann L" first="Lukas" last="Neumann">Lukas Neumann</name>
<affiliation wicri:level="3">
<country xml:lang="fr">République tchèque</country>
<wicri:regionArea>Center for Machine Perception, Czech Technical University, Prague</wicri:regionArea>
<placeName>
<settlement type="city">Prague</settlement>
<region type="région" nuts="2">Bohême centrale</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Matas, Jiri" sort="Matas, Jiri" uniqKey="Matas J" first="Jiri" last="Matas">Jiri Matas</name>
<affiliation wicri:level="3">
<country xml:lang="fr">République tchèque</country>
<wicri:regionArea>Center for Machine Perception, Czech Technical University, Prague</wicri:regionArea>
<placeName>
<settlement type="city">Prague</settlement>
<region type="région" nuts="2">Bohême centrale</region>
</placeName>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="s">Lecture Notes in Computer Science</title>
<imprint>
<date>2011</date>
</imprint>
<idno type="ISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="ISSN">0302-9743</idno>
</series>
<idno type="istex">285364B6623C7301C6B9380A708BED60EE238BBF</idno>
<idno type="DOI">10.1007/978-3-642-19318-7_60</idno>
<idno type="ChapterID">60</idno>
<idno type="ChapterID">Chap60</idno>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass></textClass>
<langUsage>
<language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Abstract: A general method for text localization and recognition in real-world images is presented. The proposed method is novel, as it (i) departs from a strict feed-forward pipeline and replaces it by a hypotheses-verification framework simultaneously processing multiple text line hypotheses, (ii) uses synthetic fonts to train the algorithm eliminating the need for time-consuming acquisition and labeling of real-world training data and (iii) exploits Maximally Stable Extremal Regions (MSERs) which provides robustness to geometric and illumination conditions. The performance of the method is evaluated on two standard datasets. On the Char74k dataset, a recognition rate of 72% is achieved, 18% higher than the state-of-the-art. The paper is first to report both text detection and recognition results on the standard and rather challenging ICDAR 2003 dataset. The text localization works for number of alphabets and the method is easily adapted to recognition of other scripts, e.g. cyrillics.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>République tchèque</li>
</country>
<region>
<li>Bohême centrale</li>
</region>
<settlement>
<li>Prague</li>
</settlement>
</list>
<tree>
<country name="République tchèque">
<region name="Bohême centrale">
<name sortKey="Neumann, Lukas" sort="Neumann, Lukas" uniqKey="Neumann L" first="Lukas" last="Neumann">Lukas Neumann</name>
</region>
<name sortKey="Matas, Jiri" sort="Matas, Jiri" uniqKey="Matas J" first="Jiri" last="Matas">Jiri Matas</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000524 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 000524 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     ISTEX:285364B6623C7301C6B9380A708BED60EE238BBF
   |texte=   A Method for Text Localization and Recognition in Real-World Images
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024